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We discuss upper bounds on the rate at which unitary evolution governed by a non-local 
Hamiltonian can generate entanglement in a bipartite system. Given a bipartite Hamiltonian 
H coupling two finite dimensional particles A and B, the entangling rate is shown to be upper 
bounded by c log (d) \\H\\, where d is the smallest dimension of the interacting particles, ||-£f || is 
the operator norm of H, and c is a constant close to 1. Under certain restrictions on the initial 
state we prove analogous upper bound for the ancilla-assisted entangling rate with a constant 
c that does not depend upon dimensions of local ancillas. The restriction is that the initial 
state has at most two distinct Schmidt coefficients (each coefficient may have arbitrarily large 
multiplicity) . Our proof is based on analysis of a mixing rate — a functional measuring how fast 
entropy can be produced if one mixes a time-independent state with a state evolving unitarily. 

1 Introduction 

Consider two remote parties Alice and Bob controlling finite-dimensional quantum systems A 
and B. Suppose A and B interact with each other according to a time- independent Hamiltonian 
H. Unless H is a sum of local Hamiltonians, a unitary evolution e tHt is a non-local operation 
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q ■ capable of creating entanglement between A and B. The goal of the present paper is to get 
an upper bound on the rate at which the entanglement between Alice and Bob can increase or 
decrease as a function of time. 

We consider ancilla-assisted entangling, see Figured! It means that Alice's laboratory con- 
sists of two subsystems: A and a, such that the Hamiltonian H acts only on the subsystem A. 
Similarly, Bob's laboratory is partitioned into B and 6, where H acts only on the subsystem 
B. The subsystems a and b are the local ancillas held by Alice and Bob. In the ancilla-assisted 
entangling Alice and Bob start from a pure state of the composite system aABb which may 
be already entangled. 

Time evolution of the composite system aABb is described by a unitary operator U(t) = 
I a <S> e lHABt <S> lb- Thus the joint state of Alice and Bob always remains pure. Accordingly, the 
entanglement between Alice and Bob at any time t can be quantified by entanglement entropy 

S(aA) = -Tr p aA {t) log p aA (t), p aA {t) = Tr B6 U(t) |*)(*| U(tf. (1) 

The quantity we are interested in is entangling rate 

dS(aA) 



dt 



(2) 
t=o 



Understanding properties of the entangling rate is crucial for optimal generation of entangle- 
ment [I], computing capacities of bidirectional quantum communication channels [2], and for 
describing dynamics of entanglement in quantum spin lattice models [3]. 
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Ancilla-assisted entangling. 



Our goal is to get an upper bound on T(^f,H) that would not explicitly depend upon 
dimensions of the local ancillas a and b. Upper bounds of this kind can be easily generalized 
via the Trotter decomposition to arbitrary multipartite Hamiltonians decomposable into a sum 
of few-party interactions. 

1.1 Previous work 

The maximal entangling rate T(H) = max^ r(\I / , H) has been studied by many authors for the 
case when A and B are qubits. The optimal state maximizing T(^f,H) has been found by 
Diir, Vidal et al [1] for a general Hamiltonian H assuming a = b = 1. These authors have 
also observed that for some Hamiltonians local ancillas are capable of increasing the maximal 
entangling rate T(H). A powerful technique of getting upper bounds on T(H) for arbitrarily 
large a and b was proposed by Childs, Leung et al [4]. It was used to identify a subclass of 
two-qubit Hamiltonians, including the Ising interaction, for which the maximum T(H) does not 
depend upon a and b (and thus can be achieved without local ancillas). An upper bound on 
the entangling rate of the Ising interaction has been also obtained by Cirac, Diir et al. [5] by 
showing that the time evolution with the Ising Hamiltonian for time t can be implemented by 
LOCC protocol consuming 0(|i|) e-bits of pre-shared entanglement. 

Some progress has been also achieved for larger dimensions of A and B. The technique 
of [4] has been generalized by Wang and Sanders pj to prove that T(H) < (3 ~ 1.9123 for any 
product Hamiltonian H = Ha <8> Hb if Ha,Hb have eigenvalues ±1. This result applies to 
arbitrary dimensions of a and b. An upper bound T(H) < (3\\H\\ for the ancilla-assisted case 
has been proved for arbitrary bipartite product Hamiltonians by Childs, Leung, and Vidal [7]. 
Finally, it was shown by Bennett, Harrow et al. [2] that T(H) < cd A \\H\\, where d = min (A, B) 
and c = O(l) does not depend upon a and b. This result relies on a decomposition of an 
arbitrary bipartite Hamiltonian into a sum of product Hamiltonians. Thus for fixed d and 
\\H\\ the entangling rate has a constant upper bound independent on how large are dimensions 
a and b. The authors of [2] also proved that the supremum of T{H) over all dimensions of 
local ancillas a, b coincides with the asymptotic capacity of H to generate entanglement by any 
protocol in which unitary evolution with H is interspersed with LOCC. It is unknown whether 
the supremum over a and b can be actually achieved for finite dimensional ancillas. 

It is not known whether T(H) = T(—H), i.e., whether the maximal entangling and disentan- 
gling rates of a given Hamiltonian H coincide. The results by Linden, Smolin, and Winter [8] 
applicable to finite unitary operators suggest that this equality might be wrong. 

1.2 Summary of results 

We start from observing that the unitary evolution with any Hamiltonian Hab can not increase 
(decrease) the entanglement S(aA) by more than 2 log(<i), where d = min (A,B), see [2]. This 
property can be called small total entangling, as it says that the total increase (decrease) of 
entanglement throughout the unitary evolution remains bounded as the dimensions of local 
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ancillas a and b go to infinity. Is there an analogue of the small total entangling property for 
infinitely small time intervals? The following conjecture was initially proposed by Kitaev [9]. 
We call it small incremental entangling. 

Small Incremental Entangling (SIE): There exists a constant c = 0(1) such that 

T(V,H) <c\\H\\ log(d), d = min(A,B) (3) 
for all dimensions a, b and for all states \*&) of a composite system aABb. 

It is not known whether SIE is true or false. Our first results is a proof of SIE for the 
special case when has at most two distinct Schmidt coefficients with respect to a partition 
aA®Bb (each Schmidt coefficient may have arbitrarily large multiplicity). Our proof yields the 
constant c = 24 although the actual value of c might be much smaller. The constraint on |\&) 
serves technical purposes and was introduced in order to make the problem tractable. It should 
be mentioned that the upper bound Eq. ([3]) can be confirmed by straightforward calculation if 
a = b = 1 (no local ancillas), see Section [2j In this case the optimal pair ($>,H) can be found 
explicitly. It turns out that the optimal state | V I / ) has only two distinct Schmidt coefficients and 
thus falls into the category that we consider, see Section [2] for details. We don't know whether 
the optimal state has only two Schmidt coefficients in the ancilla-assisted case. 

It is known that in some cases local ancillas and pre-shared entanglement can lead to counter- 
intuitive effects, such as entanglement embezzling |10j or locking of classical correlations In 
particular, it was demonstrated by DiVincenzo, Horodecki et al. [11] that sending a single qubit 
from Alice to Bob can increase their classical mutual information by an arbitrarily large amount 
in the presence of local ancillas. Thus SIE may be violated if locking effects also occur in the 
infinitesimal unitary transformations if one measures correlations by entanglement entropy. For 
future references let us give an explicit expression for the entangling rate that can be easily 
obtained by computing the derivative in Eq. ([2]). 

T(*,H) = -%Tr (I a <S> Hab[PoAB, log PoA®Ib]). (4) 

Our proof of SIE goes by getting an upper bound on a mixing rate. In order to define a 
mixing rate, consider a probabilistic ensemble of (mixed) states £ = {p a , p a }a=o,i defined on a 
finite-dimensional Hilbert space (lacking any tensor product structure). Let p = Po Po + Pi Pi 
be the average state corresponding to £. For any Hamiltonian H define a time dependent state 
p(t) = po Po + Pi e tHt p\ e~ lHt . Define a mixing rate as 

A(£ ,„ )= « . (5, 

at t=0 

Here S(p(t)) is the von Neumann entropy of p(t). Basic properties of the von Neumann entropy 
imply that S < S(p(t)) < S + h(po,pi), where S = po S(po) +p\ S(p\) is the average entropy for 
the ensemble £ and h(po,pi) = —po log (po) —p\ log (pi) is the Shannon entropy. This property 
can be called small total mixing as it says that the total increase (decrease) of the entropy 
throughout the unitary evolution goes to zero as one of the probabilities po or p\ = 1 — po 
goes to zero. Is there an analogue of the small total mixing property for infinitely small time 
intervals? A naive generalization would be as follows. 

Small Incremental Mixing (SIM): There exists a constant d = 0(1) such that 

A(£ ,H) < c \\H\\ h(p ,pi), h(p ,pi) = -p log(p ) - p\ log(pi) (6) 
for any probabilistic ensemble £ = {p a , p a }a=0,l- 

Here it is meant that the constant d is independent from the dimension of the Hilbert space. 
It is not known whether SIM is true or false. Although SIM might seem completely unrelated 
to ancilla-assisted entangling and SIE, it turns out that SIM is a stronger version of SIE. More 
strictly, we prove that SIM with a constant d implies SIE with a constant c = 4c', see Section [3j 
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Our second result is a proof of SIM for the special case when p = Po Po + Pi Pi has at most 
two distinct eigenvalues (each eigenvalue may have arbitrarily large multiplicity), see Section SJ 
We shall refer to eigenvalues obeying this constraint as a binary spectrum. Our proof yields 
d = 6, although the actual value of d might be smaller. 

The connection between SIE to SIM described in Section [3] has a peculiar property that 
the number of distinct eigenvalues of p in SIM is exactly the same as the number of distinct 
Schmidt coefficients of the initial state \^>) in SIE. Thus a proof of SIM for the case when p 
has a binary spectrum implies SIE for the case when the initial state \%>) has only two distinct 
Schmidt coefficients. In the general case we prove that the mixing rate of any ensemble £ has 
an upper bound A(£,H) = 0(\\H\\), so it does not explicitly depend on the dimension of the 
Hilbert space, see Section For future references let us give an explicit expression for the 
mixing rate: 

K(£,H) = -ipxTr (H[ p^logp]), p = PoPo + Pipi- (7) 

The paper is organized as follows. In Section [2] we find the the maximal entangling rate and 
the optimal pair (*&,H) for the case when Alice and Bob do not use local ancillas. Section [3] 
proves that SIM implies SIE. Section |4] contains the main results of the paper. It proves SIM 
for the case when p has a binary spectrum. Section [5] reports results of numerical maximization 
aimed at verifying SIM. The data obtained in numerical simulations are consistent with SIM. 



2 Maximal entangling rate in the absence of ancillas 

The maximal entangling rate can be easily found in the absence of local ancillas, i.e., when 
a = b = 1. In this case we can always write the initial state of Alice and Bob using the Schmidt 
decomposition \V) = Y^j=iy/Pj\jA) ® \jb) with d = mm(A,B). Denote p A = Z^=iPj U)0'l 
the reduced density matrix of Alice. Using the general formula Eq. (JH) for the entangling rate 
one gets 

r(V,H) = -Trp A log p A = -iTr(H[\*)(%)og p A ® I B ]). (8) 

Since the entangling rate is a linear function of H we can assume that ||i?|| = 1. Using the fact 
that for any Hermitian operator X 

max Tr(HX) = Tr |X| = llXlli, (9) 

H:\\H\\=1 

one can carry out the maximization over H, 

r(*):= max T(V,H) = \\ [logp A ® I B , |*)(*| ] ||i- (10) 

H : \\H\\=1 

For any vectors |$) one has the following identity 



|| |$)(*| - ||i = 2 V /(^|*)($|$) - |(#|$)| 2 . (11) 

Substituting |3>) = (log pa ® Ib) \®) one gets 

d I d \ 2 

r(*) = 2y/F(p), F(p) := log 2 ^- - J>i Iog Pj . (12) 

i=i \i=i / 

We can assume that all pj > (otherwise replace d by d' < d). Then the maximum of F(p) can 
be found by solving extremal point equations dF/dpj = 0. After simple algebra one gets 

d 

-log(2 Pj ) = S± v / ^ + S2, S = ~Y,Pj^g Pj , 1 = Y2' (13) 

i=i n 
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Figure 2: Optimal A (left) and entangling rate T (right) as a function of logd. 



(Here log and In stand for base two and natural logarithm.) The equality for the minus sign in 
the r.h.s. of Eq. (|13j) is possible only for one value of j, since it implies pj > 1/2. Let us agree 
that pi > 1/2. Then — log (2pj) = S + -y/7 2 + S 2 for all j = 2, . . . , d, that is the state pa must 
have a binary spectrum with multiplicities l,d—l. Introduce a variable A such that 
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Let \$f\) be the state with Schmidt coefficients p±, . 
rate with the initial state \*&\) can be written as 



,Pd- Using Eq. (|12p the maximal entangling 



T(^ A ) = 2 v / A(l-A)log 



A(d- 1) 
1-A 



(15) 



The optimal value A = X(d) and the optimal entangling rate = T(^x(d)) have to be found 
by maximizing Eq. (I15p over 1/2 < A < 1. For example, if d = 2, numerical maximization 
yields A(2) as 0.9168 and T2 ~ 1.9123. It coincides with the maximal entangling rate of product 
two-qubit Hamiltonians found in [US]. It follows that under normalization condition \\H\\ = 1 
product two-qubit Hamiltonians are capable of generating entanglement with the largest rate. 

One can easily infer from Eq. (|15p that X(d) ~ 1/2 and ~ logd for sufficiently large d. 
It proves that T(^,H) = 0(\\H\\ logd) in the absence of local ancillas. Moreover, in the limit 
of large d one can explicitly write down the optimal state |^A(d)) an d the optimal Hamiltonian 
Hd. Namely, 



l*A(d)) 



1 

71 



ID + ^I* + ) 



1^" 



Vd~ 



T^ 1 



(16) 



|$+)(1,1| -|1,1)($" 



(17) 



Thus the optimal state is a superposition of a product state and a maximally entangled state 
with locally orthogonal supports. The optimal Hamiltonian is a generator for a rotation in the 
corresponding two-dimensional subspace. 

For a finite d the optimal value X(d) and the corresponding entangling rate can be found 
numerically, see Figure [21 For large d one has X(d) ~ (1/2)(1 + l/ln<i), while entanglement 
entropy of I^a^)) scales as Sd « (1/2) \ogd. 



3 SIM implies SIE 

In this section we assume that SIM is true and show that this assumption implies SIE. Consider 
ancilla-assisted entangling with dimensions a,A,B,b and some initial state l^). Let us assume 
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that B < A. Then SIE is equivalent to an upper bound 

r(*,iJ) < c||ff||logB. (18) 

Since the r.h.s. of this inequality does not depend on a and A, it is enough to prove Eq. (|18|) in 
the case a = 1 since we can always extend A to A <g> a. Now we have a tripartite system ABb 
initially prepared in a pure state |^) and evolving under a Hamiltonian H acting on A and B. 
Using formula Eq. for the entangling rate one gets 

T(V,H) = -iTt{H[p AB ,\ogp A ®I B \) = -iTr(H[p AB ,logr AB ]), (19) 

where p A and p AB are the reduced states of |^), and a state t ab is defined as 

tab = PA® 1 -^- (20) 

In order to define a probabilistic ensemble £ needed for a reduction to SIM we shall need the 
following lemma. 

Lemma 1 For any mixed state p AB there exists a mixed state p AB such that 

I B 1 1 

PA®^ = -g2PAB + {l~-g 1 )V>AB- (21) 

Proof: Since the partial trace is a linear operation, it suffices to prove the lemma for the case 
when p AB = \ip){ifi\ is a pure state. The statement of the lemma is then equivalent to inequality 

\f){i>\<B PA ®I B . (22) 

Let d be the Schmidt rank of l^). Obviously, d < min (A, B). Consider a state \I) = Ylj=i \jA) ® 
where {|ja)} and are the local bases of A and B that diagonalize p A and p B . Then 

\i>)(i>\ = (pi®I B )\I)(I\ (pl®I B ). 

Taking into account that {I\I) = d, one concludes that |/)(/| < dI AB . Multiplying this in- 

i 

equality by p\ <g> I B on the left and on the right we get < dp A ® Ib which implies 



Eq. d22D. 

□ 

Define an ensemble of states £ = {p a , T a } a =o,i such that tq = p A s, t\ = p A B, Po = 1 — B~ 2 , 
pi = B~ 2 . Here p AB is the state that appears in the decomposition Eq. (f2Tj) . This ensemble 
has the average state r = pq tq + p\ t\ = p A (8> Ib/B. Let H be a Hamiltonian that appears in 
Eq. (fT9|) . Assuming that SIM is true one gets 

A(£,H) < c' h(B~ 2 , 1 — B~ 2 ) \\H\\ < Ac'B" 2 log£? \\H\\. (23) 

(Note that for any x < 1/2 one has h(x, 1 — x) < 2x\ logx|.) On the other hand, formula Eq. ([7]) 
for the mixing rate leads to 

A(£, H) = -i Pl Tr (H [ n,logr ]) = -iB~ 2 Tr (H [ p AB ,log r AB }). (24) 

Combining Eqs. ()23I24[) one arrives to 

-iTr (H[p AB ,logr AB }) < 4c' IogB||IT||. 

Comparing it with Eq. (fl~9|) we infer that T(^,H) < 4c' logi? \\H\\. We have proved SIE with 
a constant c = 4c', see Eq. (fl8|) . 

The reduction Eq. (|20p has a peculiar property that the number of distinct eigenvalues of 
r AB and p A are the same. Since we have used identification A = aA, the number of distinct 
eigenvalues of p A coincides with the number of distinct Schmidt coefficients of \*S>). Thus if we 
can prove SIM with p(0) having only two distinct eigenvalues, we will prove SIE for the case 
when |^) has only two distinct Schmidt coefficients. 
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4 Upper bounds on the mixing rate 

Let us start from proving a weaker (compared to SIM) upper bound on the mixing rate. 

Lemma 2 For any ensemble £ and Hamiltonian H one has 

A(£,H) <2\\H\\. (25) 

Proof: Suppose £ = {p a , p a } a =o,i, where the states po, p\ and a Hamiltonian H are defined on 
a D-dimensional Hilbert space. Consider ancilla-assisted entangling protocol shown on Figure [U 
with a = 1, A = D, B = 2, b = D. Choose a Hamiltonian H coupling A and B as 

H = H A ®\1)(1\ B . 

Choose the initial state as 

I*) = Vpo \°)b ® \<Po)Ab + Vpi \1}b ® \<j>l)Ab, 
where \4> a ) G C D (8> C D is a purification of p a . Then the state \ty) evolves in time as 

!*(*)> = V^|0)B ® |<foU + Vnl^B ® (e lHA * ® J 6 ) |0l)A6. 

Accordingly, pa(£) = Po Po + Pi e lHt p\e~ lHt . Therefore A(£,H) = T(^,H). Now we can use 
the upper bound on the entangling rate obtained in [61 [7j for product Hamiltonians, namely 
T(^,H) < P\\H\\ = P\\H\\, where (3 « 1.9123 < 2. The lemma is proved. 

□ 

This lemma implies that the whole difficulty of proving SIM (if it is true) concerns the limiting 
cases po — > or po —> 1. Note also that A(£ , H) = A(£ T , —H), where ensemble <? T is obtained 
from £ by interchanging po, Po with p±, p\. Thus we can assume that < p\ < 1/2. 

In the following we shall represent an ensemble £ = {p a , p a } a =o,i using the data (p,p,H), 
where 

p = Pi, p = (1 -p)po +PPi, 11 = pp'2 Pl p-2. (26) 
One can easily check that a triple (p, p, H) represents some ensemble £ iff 

0<p<l, 0<n</, TiUp = p. (27) 

Indeed, the only non-trivial statement is that H < I. It can be obtained from inequality 
Pi Pi < p by multiplying it by p~ l l 2 on the left and on the right. Using the representation 
Eq. (|26p one can rewrite the mixing rate as 

A(£,H) = -iTr(Hp^[U,logp)p^, (28) 

see Eq. (J7J). Given £, the optimal Hamiltonian H maximizing the mixing rate can be found 
using Eq. Q. Thus we have 

A(£) := max A(S, H) = \\ p^ [ U, logp ] p2 || x . (29) 

: || || = 1 

It is worth mentioning that for given p the upper bound Eq. ([6]) is true for sufficiently small 
p. Indeed, applying the triangle inequality for the trace norm to Eq. ([29]) one gets A(£ ) < 
2|| p2Hp2 log p Since || AB||i < ||vl|| ||i?||i for any operators A,B it follows that A(£) < 
2|| logplHI pznpz ||i. If p is smaller than the smallest eigenvalue of p, one has || logp|| < | logp| 
and thus A(£) < 2| logp| Tr (np) = 2p\ logp| < 2h(p, 1-p). 
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In the rest of the section we prove SIM under the assumption that p has binary spectrum, 



Here m can be arbitrary integer between 1 and D. Define projectors R = X^=iU)01 an d 
i?^ = I — R such that 



p = Aii? + A 2J R^, p2 = VAii?+ VA 2 ^, logp = logAii? + logA 2 i? x . 
After some algebra one gets the following identity 



is [n,io g p]p5 =i g ^/aTa^ii,/? 



(30) 



In order to upper bound the trace norm of the commutator [II, R] we shall use the following. 

Lemma 3 (Holder inequality) Let R be a projector and U be a positive semi- definite oper- 
ator. Then 



\\[IL,R]\\i <2JTr (ILR) Ti (UR X ). (31) 

Lemma 4 Let R be a projector of rank m and IT be a Hermitian operator such that < IT < /. 
Then there exists a Hermitian operator II' such that 

(i) < IT < n, 

(ii) [U,R] = [U',R), 
(Hi) TrlT' < m. 

We shall postpone the proof of the two lemmas above until the end of the section. Let us apply 
Lemma [3] to the commutator in Eqs. (I30p where IT is replaced by IT' from Lemma SI Then we 
can upper bound the mixing rate in Eq. (|29p as 



A(£) < 2 log y/Xi X 2 Tr(U'R) Tr(IT'i^). (32) 

In order to analyze the expression above introduce new variables m! ,p' , xi,X2 such that 

rri = Tr IT', p' = Tr (IT' p), Xi = A; m! . (33) 
Expressing Tr (IT' R) and Tr (IT' R^~) in terms of m' and p' one can rewrite Eq. (I32p as 



A^^^^o^ f Xl \ %i %2 (p' ~ x 2 ) (xi - p') 

A(£ <21og — W -2 =g(xi,x 2 ). (34) 

\x 2 J y (xi - x 2 ) 1 

Let us find constraints on the variables x\, x 2 . Noting that X 2 I < p < X\I and taking the trace 
with IT' one gets x 2 < p 1 < x\. Besides, condition (iii) of Lemma |4] implies that m! < m and 
thus x\ < mX\ < Trp = 1. By obvious reasons one also has x 2 > 0. Summarizing, 

< x 2 < p' < xi < 1. (35) 

Now the problem of getting upper bound on the mixing rate reduces to maximizing a function 
g(x\,x 2 ) in Eq. ([34]) under constraints Eq. ([35]) . We prove (see Lemma [5] at the end of the 
section) that max IliI2 g(x\, x 2 ) < 6p'\ logp'| as long as p' < 1/2. Using condition (i) of LemmaH] 
we get p' = Tr (II'p) < Tr (ITp) = p. As was mentioned in the beginning of the section, we can 
assume that p < 1/2 and thus p' < p < 1/2. Summarizing, we get 

A(£) < 6p'\ logp'| < 6p\ logp| < 6 h(p, 1 — p), 



where we used the fact that a function x\ logx\ is monotone increasing on the interval [0, 1/2]. 
Thus we have proved SIM under the assumption that p has a binary spectrum. 
Proof of Lemma [3} Holder inequality asserts that 



|| XY ||i < ^/Tr (XtX) Tr (YtY) (36) 

for any operators X and y, see [12] . In order to choose proper X and Y let us use an identity 
[n, R] = R^UR — RUR 1 - and triangle inequality for the trace norm: 

|| [II, R] ||i < Hi^ILRHi + \\RHR 1 - ||i = 2||i? ± iLR||i = 2|| (R L U^)(n^R) ||i. 



Substituting X = R II 2 and Y = H^R into Eq. (|36j) and noting that R, R are projectors we 
get the inequality stated in the lemma. 

□ 

Proof of Lemma [4j Any Hermitian operator II satisfying < II < I can be written as a 
convex combination of projectors: 

n = ^ Pa n a , ntn a = n Q . 

a 

Suppose we can find the operator promised in the lemma for every projector in the sum, that 
is we can find Ii' a such that (i) < Ii' a < U a , (ii) [Il' a , R] = [U a , R], and (iii) Tr Ii' a < m. Then 
we can choose the desired operator IT' as IT = J2 a Pa^'a- Thus it suffices to prove the lemma 
for the case when II is a projector. Let D be the dimension of the Hilbert space. Consider a 
direct sum decomposition 

where Ti is the range of R, so that dim(Ti) = m. Then we can write 

R=(i nV n=M c n ). (37) 



j f v ° ] B , 

Here < A < I and < B < I are Hermitian operators on Ti and Ti 1 - and C is some operator 
C : Ti 1 - — > Ti. The requirement that n is a projector implies 

A(I - A) = CC\ B (I — B) = C^C, C(I -B) = AC. (38) 

Consider a decomposition A = A' © A", B = B' © B" where < A' < I and < B' < I, while 
A" and B" have only eigenvalues 0, 1. It follows from Eq. (|38p that 



A" CC ] = CC ] A" = 0, B" C ] C = C ] C B" = 0. 

Thus \ip) =0 for any from the range of A" and C \ <p) = for any \cf)} from the range of 
B". Accordingly, the projector n has the following block structure 



n 



/ A" 
















A' 


a 










(C)t 


B' 







V 








B" 


) 



(39) 



Let n' be the central block in n. Clearly n' is a projector and n' satisfies conditions (i) and (ii) 
of the lemma. It remains to check that Tr n' < m. Let m' be the dimension of the block A' . By 
definition, m! < m. Since n' is a projector, the operators A',B',C obey the same constraint 
as Eq. ([38]), that is 

A' (I -A') = C{C')\ B'(I-B') = (C'yC, C' (I — B') = AC'. (40) 
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Since < A' < I and < B' < I we conclude that C'(C')^ and (C')^C' are non-singular 
matrices and thus the dimensions of the blocks A' and B' both equal to m! . Also from Eq. (|40p 
we infer that A 1 = C (I — B') (C) , that is the spectrum of A 1 coincides with the spectrum 
of I — B' including multiplicities. Accordingly, Trll' = Tr A' + Tr B' = m! < m. Therefore II' 
satisfies all three conditions of the lemma. 

□ 

Lemma 5 Let < q < 1/2 be a real number. Consider a function 



. xi\ x 1 x 2 (q-x 2 )(x 1 -q) 
g(x 1 ,x 2 ) = 2log[ — ' 



X 2 ) V {X\ — X 2 ) 2 

Suppose < x 2 < q and q < X\ < 1. Then g(x\,x 2 ) < 6g|log(/|. 

Proof: Let us consider three cases: 
case 1: 2q < x\ < 1, g/4 < x 2 < q. 

Then log (x\/x 2 ) < log (4/g) < 3| log q| . Using an upper bound x 2 (q — x 2 ) < q 2 /4 we arrive to 



g(xi,x 2 ) < 3g | log g| * / ^li^l — ?2 < 3g | log g| ,/ Xl < 
V (^i - V ( x i ~ 9) 



< 3q\logq\ J Q < 3\/2g|logg|. 
\l 2q-q 

The last inequality follows from monotonicity of a function x±/(xi — q) on the interval (q, oo). 
case 2: q < x\ < 2q, q/4 < x 2 < q. 

First note that log {x\/x 2 ) < log (8) = 3. Take into account that 

(q - x 2 )(x 1 - q) 1 
(xi - x 2 ) 2 ~ 2 

Therefore 

g{x\ ,x 2 ) < 3\/2 ^x\x 2 < 3V2 (2q)q < 6q < 6q | log q\ . 
case 3: q < x\ < 1, < x 2 < q/4. 

Introduce new variable y such that x 2 = qy, that is < y < 1/4. Then 



g(xi,x 2 ) < 2q v / y(l - y) |loggy| k(x{) 

where 

k(xi) = 



xi(x 1 - q) 



y (sci - g/4) 2 ' 

One can check that k{x\) < 2/v3 for all g < xi < 1. Using inequality 



- y) I log 92/1 < Vy I log qy\ = ^Jy | log g| + 2^/y I log y/y\ 
and noticing that i 1 log 1 1 is monotone increasing for 0<t<l/2we conclude that 

1 3 

Vv\ lo Sqy\ < ^\^og(q/4)\ < -|logg| 

for any < y < 1/4. Thus 

g(xi,x 2 ) < 2s/3q | log g|. 

Combining all three cases we get 

g(xi,x 2 ) < 6q\\ogq\. 



□ 
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5 Numerical maximization of the mixing rate 



This section describes numerical simulations aimed at verifying SIM. Let us start from expression 
Eq. (|28|) for the mixing rate and the constraints Eq. (|27|) . It is convenient to represent the 
Hamiltonian as H = 2K - I. Note that \\H\\ < 1 iff < K < I. Then A(£,H) = F(K,U) 
where 

F(K, LT) = -2iTr (k P ^ [IL,log p]p?) = 2iTr (u P ^ [K,]ogp]p*\ (41) 



For a fixed average state p SIM is equivalent to an upper bound 

max F(K,U) < c' h(p, 1 -p), (42) 

where the maximization is subject to 

< K < I, < n < /, Tr (Up) = p. (43) 
We found the maximum of F(K, II) numerically for the average states 

P = P (D) = z- 1 J2r 1 \j)(j\, z=j2r\ m 

3=1 3=1 

with the dimension D = 4, 8, 16, 32 and for several values of p between and 1/2, see Figure 3 
(as was mentioned in Section HI it is enough to consider p between the smallest eigenvalue of p 
and 1/2). 

The motivation for this particular choice of the average state comes from the fact that 
the states p( D ' can "embezzle" any other mixed state for sufficiently large D, see [10] . More 
strictly, it was proved in [10] that for any state p there exists an isometry V such that p^ D > ~ 
W {p {D) ®p)V in the sense that fidelity between the two states goes to 1 as D goes to infinity. In 
particular this is true for the optimal state p corresponding to the optimal ensemble £ = (p, p, II) 
maximizing the mixing rate in Eq. (|29p for a fixed p. On the other hand, the state p( D > (g> p is at 
least as good as p as far as the mixing rate is concerned. Indeed, one can easily verify that the 
mixing rate for the ensemble (p, p, II) is the same as the mixing rate for the ensemble (p, p, II), 
where p = p( D ' (g> p and fl = I (g) II. Thus a verification of SIM for the family of states p^ is a 
good test for general validity of SIM. 

The data obtained in the numerical maximization of F(K, LT) are presented in Figure 3. They 
are consistent with the conjecture Eq. (|42p with a constant d = 1. The numerical algorithm 
that we used is based on reformulation of the maximization problem as a semidefinite program. 
The details of the algorithm are described in Appendix A. 



Acknowledgements: The author thanks Alexei Kitaev for helpful discussions. Numerous 
useful comments from Charles Bennett and John Smolin are acknowledged. This research was 
supported by the NSA and the ARDA through ARO contract number W911NF-04-C-0098. 



Appendix A 

Maximization of the objective function F(K, LT), see Eq. (|4ip . over K for fixed II or vice verse is 
a semi-definite program that can be solved very efficiently. In order to find the global maximum 
F max of F(K, II) we used a sequence of N 3> 1 alternating single- variable maximizations over 
K and over II. To reduce the probability of being trapped in a local maximum, the whole 
procedure was repeated M 1 times with a random choice of initial pair K, II. The algorithm 
outputs the maximal found value of F among all M rounds. The data presented on Figure 3 
were obtained with a choice of parameters N = 32 and M = 128. Based on fluctuations in the 
values of F max found in different rounds we estimate the precision of the algorithm as 5 < 10 -2 . 
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Figure 3: Maximal mixing rate F max = m&xF(K, IT) (vertical axis) computed for a fixed average 
state p = p^ D \ see Eq. ( I44l . and fixed p (horizontal axis). The maximization was performed for 
D = 4,8, 16, 32. The upper curve shows the Shannon entropy h(p, 1—p) = — plogp— (1— p) log (1 — p). 



Maximization of F over K for fixed II reduces to diagonalization of a Hermitian traceless 



operator Z := —2ip2 [ n, log p]p 2 , namely 



max F(K,I1) 

0<K<I 



max Tr (Z K) = — \\Z\\i . 
0<K<I y ' 2 11 11 



The optimal K can be chosen as a projector onto the positive eigensubspace of Z. 

Maximization of F over II for fixed K reduces to a semidefinite program. Define a Hermitian 
traceless operator X = 2ipz [ K, log p ] p? . Then we have to find 



; (p, p) = max Tr (X II) . 
0<n<7, Tr(n p)=p 



(45) 



In practice it is more convenient to solve semidefinite program dual to Eq. (|45|) as it reduces to 
diagonalization of operators and minimization of a convex function of one real variable which 
can be done by the gradient methods. The following lemma shows the connection between the 
primal and the dual problems. 



Lemma 6 Denote 



Ao = ™n(/(p-^) + ~ll*-A,|,, 



(46) 



Let P + and Pq be orthogonal projectors onto the positive eigensubspace and zero eigensubspace 
of an operator X — Xq p respectively. Then F max (p,p) = \q and the optimal operator II can be 
chosen as 

U = P + + xP , (47) 
where the coefficient x is determined from Tr (II p) = p. 

Proof: Let us find semidefinite program dual to Eq. (J45]). The equality Tr(/jII) = p gets 
Lagrangian multiplier A € M. The inequalities II < / and — II < get Lagrangian multipliers 
A > and B > 0. If one can choose A, A, B such that X = X p + A — B (the gradient of the 
objective function is a convex combination of gradients of the constraints), one gets an upper 
bound 

Tr(IIX) = A Tr (np) +Tr(^4n) -Tr(£II) < Ap + Tr A (48) 
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The duality principle asserts that this upper bound is tight if either the primal or the dual 
problems are strictly feasible (all inequalities can be made strict). In our case the primal 
problem Eq. (|45p is strictly feasible for any < p < 1. Indeed, take II = pi. Then < II < I 
and Tr (Up) = p. Thus F max (p, p) coincides with the solution of the dual problem 

minimize (A p + Tr A) subject to < — ^' ^ ~ ^' (49) 



X = Xp + A-B 

Let us carry out the minimization in two stages: first minimize the objective function over A 
and then minimize the resulting (non-linear) function of A. Let A, A, B be any feasible solution 
of the dual problem. The triangle inequality for the trace norm implies that 

Tr A + TrB > \\X - Ap||i. (50) 

Since X is traceless, we also have 

Tt A — TiB = - A. (51) 

Adding Eqs. (|50|5ip together we get 

TvA^-iWX-Xph-X). (52) 

The equality here is achieved when A and B are the positive and the negative parts of X — X p. 
Thus we get 

F m ax(p,p) = min ( X(p - -) + \\\X - Ap||i ) . (53) 



The objective function is a convex one. It grows as (1— p)|A| for A — > — oo and as p X for A — > oo. 
Therefore the minimum is achieved at some finite A which we denote Ao- Equality in Eq. f)48f) 
is possible only if 

Tr(/-n)A = 0, Tr(UB) = 0, Tr (Up) = p. 

One can choose a solution as II = P + + x Pq , where x is to be found from the constraint 
Tr (Up) = p. 

□ 
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